AITopics | bitter lesson

Collaborating Authors

bitter lesson

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Pre-training isn't bitter enough

AIHubJul-10-2026, 08:42:06 GMT

Richard Sutton's "Bitter Lesson" is usually read as a warning against building too much human knowledge into AI systems. Over the long run, the methods that win are not the ones that encode our clever intuition most directly, but the ones that scale: search, learning, and other general methods that can absorb more compute and data. We take a general architecture, expose it to massive data, and train it with a simple self-supervised objective. Language models predict the next token. Vision models reconstruct masked patches, align views, or match teacher representations.

artificial intelligence, learner, machine learning, (16 more...)

AIHub

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

The bitter lesson of misuse detection

Mariaccia, Hadrien, Segerie, Charbel-Raphaël, Dorn, Diego

arXiv.org Artificial IntelligenceJul-10-2025

Prior work on jailbreak detection has established the importance of adversarial robustness for LLMs but has largely focused on the model ability to resist adversarial inputs and to output safe content, rather than the effectiveness of external supervision systems. The only public and independent benchmark of these guardrails to date evaluates a narrow set of supervisors on limited scenarios. Consequently, no comprehensive public benchmark yet verifies how well supervision systems from the market perform under realistic, diverse attacks. To address this, we introduce BELLS, a Benchmark for the Evaluation of LLM Supervision Systems. The framework is two dimensional: harm severity (benign, borderline, harmful) and adversarial sophistication (direct vs. jailbreak) and provides a rich dataset covering 3 jailbreak families and 11 harm categories. Our evaluations reveal drastic limitations of specialized supervision systems. While they recognize some known jailbreak patterns, their semantic understanding and generalization capabilities are very limited, sometimes with detection rates close to zero when asking a harmful question directly or with a new jailbreak technique such as base64 encoding. Simply asking generalist LLMs if the user question is "harmful or not" largely outperforms these supervisors from the market according to our BELLS score. But frontier LLMs still suffer from metacognitive incoherence, often responding to queries they correctly identify as harmful (up to 30 percent for Claude 3.7 and greater than 50 percent for Mistral Large). These results suggest that simple scaffolding could significantly improve misuse detection robustness, but more research is needed to assess the tradeoffs of such techniques. Our results support the "bitter lesson" of misuse detection: general capabilities of LLMs are necessary to detect a diverse array of misuses and jailbreaks.

large language model, machine learning, natural language, (17 more...)

arXiv.org Artificial Intelligence

2507.06282

Country: Europe > Switzerland (0.14)

Genre: Research Report > New Finding (1.00)

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

What the F*ck Is Artificial General Intelligence?

Bennett, Michael Timothy

arXiv.org Artificial IntelligenceMar-31-2025

Artificial general intelligence (AGI) is an established field of research. Yet Melanie Mitchell and others have questioned if the term still has meaning. AGI has been subject to so much hype and speculation it has become something of a Rorschach test. Mitchell points out that the debate will only be settled through long term, scientific investigation. To that end here is a short, accessible and provocative overview of AGI. I compare definitions of intelligence, settling on intelligence in terms of adaptation and AGI as an artificial scientist. Taking my queue from Sutton's Bitter Lesson I describe two foundational tools used to build adaptive systems: search and approximation. I compare pros, cons, hybrids and architectures like o3, AlphaGo, AERA, NARS and Hyperon. I then discuss overall meta-approaches to making systems behave more intelligently. I divide them into scale-maxing, simp-maxing, w-maxing based on the Bitter Lesson, Ockham's and Bennett's Razors. These maximise resources, simplicity of form, and the weakness of constraints on functionality. I discuss examples including AIXI, the free energy principle and The Embiggening of language models. I conclude that though scale-maxed approximation dominates, AGI will be a fusion of tools and meta-approaches. The Embiggening was enabled by improvements in hardware. Now the bottlenecks are sample and energy efficiency.

artificial general intelligence, large language model, machine learning, (18 more...)

arXiv.org Artificial Intelligence

2503.23923

Country:

North America > United States > Texas (0.04)
Europe > Iceland > Capital Region > Reykjavik (0.04)

Genre: Research Report (0.41)

Industry:

Information Technology (0.48)
Leisure & Entertainment > Games > Go (0.34)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Cognitive Science (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.68)

Add feedback

Learning the Bitter Lesson: Empirical Evidence from 20 Years of CVPR Proceedings

Yousefi, Mojtaba, Collins, Jack

arXiv.org Artificial IntelligenceOct-12-2024

This study examines the alignment of \emph{Conference on Computer Vision and Pattern Recognition} (CVPR) research with the principles of the "bitter lesson" proposed by Rich Sutton. We analyze two decades of CVPR abstracts and titles using large language models (LLMs) to assess the field's embracement of these principles. Our methodology leverages state-of-the-art natural language processing techniques to systematically evaluate the evolution of research approaches in computer vision. The results reveal significant trends in the adoption of general-purpose learning algorithms and the utilization of increased computational resources. We discuss the implications of these findings for the future direction of computer vision research and its potential impact on broader artificial intelligence development. This work contributes to the ongoing dialogue about the most effective strategies for advancing machine learning and computer vision, offering insights that may guide future research priorities and methodologies in the field.

large language model, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

2410.09649

Country:

North America > United States > Massachusetts > Suffolk County > Boston (0.04)
North America > United States > California > San Mateo County > Menlo Park (0.04)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)

Industry: Education (1.00)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Search (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
(2 more...)

Add feedback

The real "Bitter Lesson" of artificial intelligence – TechTalks

#artificialintelligenceDec-7-2022, 08:59:07 GMT

In a popular blog post titled "The Bitter Lesson," Richard Sutton argues that AI's progress has resulted from cheaper computation, not human design decisions based on problem-specific information. Sutton diminishes researchers that build knowledge into solutions based on their understanding of a problem to improve performance. This temptation, Sutton explains, is good for short-term performance gains, and such vanity is satisfying to the researcher. However, such human ingenuity comes at the expense of AI's divine destiny by inhibiting the development of a solution that doesn't want our help understanding a problem. AI's goal is to recreate the problem-solver ex nihilo, not to solve problems directly.[1]

bitter lesson, human ingenuity, sutton, (14 more...)

#artificialintelligence

Country: North America > United States > New York (0.05)

Industry: Leisure & Entertainment > Games > Chess (0.99)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Thoughts: Sutton's The Bitter Lesson

#artificialintelligenceMay-10-2022, 00:55:04 GMT

It states that general learning methods that can scale with computation are ultimately the most effective. The two methods that can seemingly scale endlessly are search and learning, and they have bore their fruit. Sutton lists out their successes in chess, go, speech recognition, computer vision, etc, etc. This is in contrast to the human-knowledge approach, where our knowledge of a specific domain is built into the algorithms that are trying to "solve" or "work-out", so to speak, that domain. In speech recognition, this was with the hand crafting of phonemes, words, etc; in games like chess/go this was through crafting for features of the game; and the list goes on and on.

bitter lesson, knowledge, sutton, (8 more...)

#artificialintelligence

Industry: Leisure & Entertainment > Games > Chess (0.48)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

The Bitter Lesson of Machine Learning - KDnuggets

#artificialintelligenceJul-16-2020, 22:11:05 GMT

The biggest lesson that can be read from 70 years of AI research is that general methods that leverage computation are ultimately the most effective, and by a large margin. The ultimate reason for this is Moore's law, or rather its generalization of continued exponentially falling cost per unit of computation. Most AI research has been conducted as if the computation available to the agent were constant (in which case leveraging human knowledge would be one of the only ways to improve performance) but, over a slightly longer time than a typical research project, massively more computation inevitably becomes available. Seeking an improvement that makes a difference in the shorter term, researchers seek to leverage their human knowledge of the domain, but the only thing that matters, in the long run, is the leveraging of computation. These two need not run counter to each other, but in practice, they tend to.

artificial intelligence, computation, machine learning, (16 more...)

#artificialintelligence

Country: North America > Canada > Alberta (0.06)

Industry: Leisure & Entertainment > Games (0.56)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.32)

Add feedback